A Boolean measure of similarity

نویسندگان

  • Martin Anthony
  • Peter L. Hammer
چکیده

We propose a way of measuring the similarity of a Boolean vector to a given set of Boolean vectors, motivated in part by certain data mining or machine learning problems. We relate the similarity measure to one based on Hamming distance and we develop from this some ways of quantifying the ‘quality’ of a dataset. Acknowledgements: Part of this work was carried out while Martin Anthony was visiting RUTCOR, Rutgers University. Martin Anthony’s work is supported in part by the IST Programme of the European Community, under the PASCAL Network of Excellence, IST2002-506778. The authors thank Alex Kogan, Ersoy Subasi, Mine Subasi, and Ben Veal for useful discussions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Boolean Similarity Measures for Resource

We develop a new method to rank the degree of similarity between Boolean expressions contrast it with other known methods and describe its implementation Our method reduces time and space complexity from exponential to polynomial in the number of Boolean terms Index Terms Boolean query information retrieval ranking resource discovery similarity measure

متن کامل

Modal similarity

Just as Boolean rules define Boolean categories, the Boolean operators define higher-order Boolean categories referred to as modal categories. We examine the similarity order between these categories and the standard category of logical identity (i.e. the modal category defined by the biconditional or equivalence operator). Our goal is 4-fold: first, to introduce a similarity measure for determ...

متن کامل

L-Fuzzy Valued Inclusion Measure, L-Fuzzy Similarity and L-Fuzzy Distance

The starting point of this paper is the introduction of a new measure of inclusion of fuzzy set A in fuzzy set B. Previously used inclusion measures take values in the interval [0,1]; the inclusion measure proposed here takes values in a Boolean lattice. In other words, inclusion is viewed as an Lfuzzy valued relation between fuzzy sets. This relation is reflexive, antisymmetric and transitive,...

متن کامل

Analysis of Case-Based Representability of Boolean Functions by Monotone Theory

Classi cation is one of major tasks in case-based reasoning(CBR) and many studies have been done for analyzing properties of case-based classi cation [1, 14, 10, 15, 12, 9, 13, 7]. However, these studies only consider numerical similarity measures whereas there are other kinds of similarity measure for di erent tasks. Among these measures, HYPO system [2, 3] in a legal domain uses a similarity ...

متن کامل

An improved similarity measure of generalized trapezoidal fuzzy numbers and its application in multi-attribute group decision making

Generalized trapezoidal fuzzy numbers (GTFNs) have been widely applied in uncertain decision-making problems. The similarity between GTFNs plays an important part in solving such problems, while there are some limitations in existing similarity measure methods. Thus, based on the cosine similarity, a novel similarity measure of GTFNs is developed which is combined with the concepts of geometric...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 154  شماره 

صفحات  -

تاریخ انتشار 2006